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The univariate piecing-together approach (PT) fits a univariate generalized Pareto distribution 
(GPD) to the upper tail of a given distribution function in a continuous manner. We propose 
a multivariate extension. First it is shown that an arbitrary copula is in the domain of attraction 
of a multivariate extreme value distribution if and only if its upper tail can be approximated by 
the upper tail of a multivariate GPD with uniform margins. 

The multivariate PT then consists of two steps: The upper tail of a given copula C is cut 
off and substituted by a multivariate GPD copula in a continuous manner. The result is again 
a copula. The other step consists of the transformation of each margin of this new copula by 
a given univariate distribution function. 

This provides, altogether, a multivariate distribution function with prescribed margins whose 
copula coincides in its central part with C and in its upper tail with a GPD copula. 

When applied to data, this approach also enables the evaluation of a wide range of rational 
scenarios for the upper tail of the underlying distribution function in the multivariate case. We 
apply this approach to operational loss data in order to evaluate the range of operational risk. 

Keywords: copula; domain of multivariate attraction; GPD copula; multivariate extreme value 
distribution; multivariate generalized Pareto distribution; operational loss; peaks over 
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1. Introduction 

The peaks over threshold approach (POT) shows that the upper tail of a univariate dis- 
tribution function F can reasonably be approximated only by that of a generalized Pareto 
distribution (GPD). This result goes back to Balkcma and de Haan [2] and Pickands [30]. 
A univariate GPD W is derived from an extreme value distribution (EVD) G by the 
equality 

W{x) = l + \og(G(x)), l/e<G(x), 
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where, with a shape parameter a > 0, the family of standardized EVD is given by 

Gi,a(x) = exp(— x~ a ), x>0, 

G 2 , a (x)=exp(-(- a ;) Q ), a;<0, (1) 
G 3 (x) = exp(-e _x ), xeR, 

being the Frechet, (reverse) Weibull and Gumbel case of an EVD. 
The family of univariate standardized GPD is, consequently, given by 

Wi, a (x) = l-aT a , x>l, 
W 2 , a (x) = l-(~x) a , -l<x<0, 
W 3 (x) = l-exp(-x), x>0, 

being the Pareto, beta and exponential case of a GPD. 

If X is a univariate random variable with distribution function F, then the distribution 
function F^ x °' of X , conditional on the event X > xq, is given by 



F [xo] (x)=P(X<x\X>x ) 

F(x) - F(x ) 
l-F(x ) ' 



x>xo, 



where we require F(xo) < 1. The POT approach shows that i^ x °] can reasonably be ap- 
proximated only by a GPD with appropriate shape, location and scale parameter W 7i/ii<7 . 
Note that 

F(x) = (1 - F(x ))F^(x) + F(x ) 

w (1 - F(x ))W ltfl , a {x) + F(x ), X > x . 

The piecing together approach (PT) now consists in replacing the distribution function F 
by 

p* („\ — J x<x , (9 s 

\(1-F(x ))iy 7 ^(x)+F(x ), x>x , {Z) 

where the shape, location and scale parameters 7, fj., a of the GPD are typically estimated 
from given data. This modification aims at a more precise investigation of the upper end 
of the data. 

Replacing F in (2) by the empirical distribution function F n of n independent copies 
of X offers in particular a semi-parametric approach to the estimation of high quan- 
tiles F^ 1 (q) = mi{t £ R: F(t) > q} outside the range of given data; see, for example, 
Section 2.3 of Reiss and Thomas [33]. 

In this paper we propose an extension of the PT in (2) to higher dimensions. When 
applied to data, this approach also enables the evaluation of a wide range of rational 
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scenarios for the upper tail of the underlying distribution function in the multivariate 
case. This will be exemplified in Section 4 for operational loss data, where we simulate 
different scenarios for risk parameters such as the value at risk or the expected shortfall. 
In Section 2 we provide the basic mathematics for our PT approach. We will show that 
an arbitrary copula can reasonably be approximated in its upper tail only by a GPD 
with uniform margins. 

The multivariate PT approach, which will be established in Section 3, now consists of 
two steps: 

(i) The upper tail of a given m-dimensional copula C is cut off and substituted by 
the upper tail of multivariate GPD copula in a continuous manner such that the result 
is again a copula. 

(ii) The other step consists of the transformation of each margin of this new copula 
by a given univariate distribution function F* , 1 < i < m. 

This provides, altogether, a multivariate distribution function with prescribed mar- 
gins F* , whose copula coincides in its central part with C and in its upper tail with 
a GPD copula. In Section 4.2 we will simulate the effects that the combination of uni- 
variate and multivariate PT has on quantile functions and mean excess functions or, in 
terms of risk analysis, on value at risk and expected shortfall. It turns out that in our 
specific model, which will be specified in Section 4, the application of the multivariate 
PT approach leads to a rising expected shortfall while the value at risk keeps (up to 
a level of 99.9%) nearly unchanged. 

Instead of fitting a GPD to the upper tail of a distribution, estimation of rare events in 
the multivariate case can also be based on the fact that the exponent measure pertaining 
to a multivariate GPD is homogeneous; see de Haan and Sinha [II] and de Haan and 
Rondc [9] for details. 

For recent accounts of basic and advanced topics of extreme value theory and statistics, 
see the monographs by Reiss and Thomas [33], de Haan and Ferreira [10] and Resnick [34]. 

2. Multivariate GPD 

In this section we provide the mathematics underlying our PT approach, which will be 
established in Section 3. 

Let F be an arbitrary m-dimensional distribution function that is in the domain of 
attraction of an m-dimensional EVD G; that is, there exist norming constants a„ > 0, 
b n € K m such that 

F n (a„x + b„) -> G(x), xeM m , (3) 

n— >oo 

where all operations on vectors are meant componentwise. The distribution function G 
is max stable; that is, there exist norming constants c„ > 0, d„ € M. m with 

G"(c n x + d„) = G(x), xgR™. 

The one-dimensional margins Gi of G are up to scale and location parameters univariate 
EVD in (1). 
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It is well known that (3) is equivalent with convergence of the univariate margins 
together with convergence of the copulas 

hm C^(u 1 /") = C G (u)=G(Gr 1 («i),...,G,- 1 K i )), uefO.lf, (4) 

n— >-oo 

(Deheuvels [12, 13], Galambos [19]). For a recent account on copulas, see Nelsen [29]. 
Some elementary computations as in Falk [16], Section 6, or de Haan and Ronde [9], 
Section 4.2, entail that convergence (4) is equivalent with 

hmi(l-C' F (l + tx))=/ G (x):=-log(C G (exp(x))), x < 0, (5) 

where Iq is known as the stable tail dependence function introduced by Huang [22]. For 
a detailed discussion of the stable tail dependence function, see Beirlant et al. [3] . The 
stable tail dependence function is homogeneous ^g( x ) = ^G^x), t > 0, and, thus, (5) 
becomes 

l-C F (l + tx)-l G (tx) 
>uo0. 

Observe that ?g( x ) = 1 — x < 0, where H is a multivariate GP function with 

uniform margins Hi(x) = 1 + x, x < 0, i < to; that is, 

ff(x) = l+log(G(x)), x<0, 

and G is a multivariate EVD with negative exponential margins Gi(x) = exp(x), x < 0, 
i to. 

We call in general an TO-dimensional distribution function W a multivariate GPD if 
its upper tail coincides with a GP function; that is, there exist a multivariate EVD G 
and a vector Xo £ R m with G(xo) < 1 such that 

W(x) = l + log(G(x)), x>x . (6) 

Note that H(x) = 1 + log(G(x)), G(x) > 1/e, does not define a distribution function 
unless to £ {1,2}; see Michel [27], Theorem 6. We, therefore, call H a GP function. It 
is, actually, a quasi-copula (Alsina et al. [1], Genest et al. [20] and Section 5.1 in Falk 
et al. [17]). Lemma 5.1.5 in Falk et al. [17] implies, on the other hand, that for any GP 
function there exists a distribution function W satisfying (6). 

The preceding considerations together with elementary computations entail now the 
following characterization of domains of attraction in terms of a GPD. By || • || we denote 
an arbitrary norm on R m . 

Theorem 2.1. An arbitrary distribution function F is in the domain of attraction of 
a multivariate EVD G if and only if this is true for the univariate margins and if there 
exists a GPD W with ultimately uniform margins Wi(x) = 1+x, xq < x < 0, i < to, such 
that 

<My) = W(y-l) + o(||y-l||) 

uniformly for y £ [0, l] m . 
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We have the following equivalences for an arbitrary copula C to lie in the domain of 
attraction of an EVD. 

Corollary 2.2. C is in the domain of attraction of an EVD G 

<=^ There exists a GPD W with ultimately uniform margins such that 

C(y) = W(y-l) + o(||y-l||), 

uniformly for y G [0, l] m . In this case W(x) = 1 + log(G(x)) ; x < x < G M m . 
There exists a norm \\ ■ \\d on R m such that 

C(y) = l-||y-l||z> + o(||y-l|| D ), 
uniformly for y G [0, l] m . In this case G(x) = exp( — ||x||_d) , x < 0. 

Recall that all norms on M. m are equivalent and, thus, o(||y — 1\\d) in the second 
equivalence above can be substituted by o(||y — 1||) with an arbitrary norm || • || on E m . 

The preceding results show that the upper tail of the copula Cf of a distribution 
function F can reasonably be approximated only by that of a GPD W with ultimately 
uniform margins. To the best of our knowledge, this provides new insight into the signif- 
icance of multivariate GPD. But it is in accordance with Rootzen and Tajvidi [35], who 
showed that, in the multivariate case, modelling exceedances of a random variable over 
a high threshold can rationally be done only by a multivariate GPD. 

Proof of Corollary 2.2. It is well known that a GPD W with ultimately uniform 
margins can be written as 

W(x) = l-\\x\\ D , x <x<0, 

where || • ||d is a norm on R m with particular properties, called a D-norm; see Section 4.4 
in Falk et al. [17]. In particular, G(x) = exp(-||x||x>), x < 0, defines an EVD on K m . If 
C(y) = W(y- l)+o(||y- 1||), y G [0, l] m , for some norm || ■ || on W m , then 

C "{ 1 + l) = ( 1_ ^ l|y||l5+0 (« llyll )) rl 

exp(-||y|| D ) = G(y), y < 0. 

n— >oo 

Together with Theorem 2.1 this implies Corollary 2.2. □ 

In the final equivalence of Corollary 2.2, the norm can obviously be computed as 

,. 1-Gfl + ix) ,. , 
||x|| D =lim K - t J - = l{x), x<0: 

that is, it is the stable tail dependence function. It turns out that any stable tail de- 
pendence function is actually a norm. This explains why it is a convex function and 
homogeneous of order one. 
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Example 2.3. Take an arbitrary Archimedean copula 

C v {u) = ip~ x {ip{ui) H h <p(u m )), 

where the generator ip : (0, oo) — > [0, oo) is a continuous function that is strictly decreasing 
on (0, 1], (p(l) = 0, lim x j,o ip(x) = oo and <p~ 1 {t) = infja- > 0: <p{x) <t}, t> 0. 

Note that C v is not automatically a copula for each function ip: (0,oo) — > [0,co) as 
above. While, in the bivariate case m = 2, convexity of ip^ 1 is a necessary and suf- 
ficient condition, this is no longer true in higher dimension m > 3. Instead, C v is, 
for general dimension m > 2, a copula if and only if tp _1 is differcntiablc up to order 
m - 2, the derivatives satisfy (-l) fc (< i 3~ 1 )( fc ) (a;) > 0, k = 0, ...,m — 2, a; 6 (0,oo) and 
further if (— l) m_2 ((p _1 )( m_2 ) is non-increasing and convex in (0,oo); see McNeil and 
Neslehova [24], Theorem 2.2. 

If ip is differcntiable from the left in x = 1 with left derivative cp' (1— ) / 0, then 

l-C„(l + tx) = £ N = ||x||i> x < „. 

z<m 

that is, each Archimedean copula with a generator ip as above is in the domain of at- 
traction of the EVD G(x) = exp(— ||x||i), x < 0, with independent margins. The margins 
of C v are, therefore, tail independent; that is, the tail dependence parameters vanish: 

X(i,j) ■= limPffj > x | Uj > x) = 0, 1 < i ^ j < m, 

where the random vector (U\, . . . , U m ) follows the distribution function C v . For a dis- 
cussion of the tail dependence parameter and further literature, see Section 6.1 in Falk 
et al. [17]. 

The preceding considerations concern, for example, the Clayton and the Frank cop- 
ula, which have generators ipc{t) = i9 -1 (i~' 5 — 1) and (pp(t) = — log((exp(— M) — 1)/ 
(exp(— ■&) — 1)), -d > 0, but not the Gumbcl copula with parameter A > 1, which has 
generator <p G {t) = -(log(t)) A , A > 1, < t < 1. 

Any multivariate EVD G has univariate EVD margins and any multivariate GPD W 
has univariate GPD margins in its upper tail. We can transform an arbitrary multivariate 
EVD to an EVD with negative exponential margins by just transforming the margins. 
Equally, we can transform an arbitrary W to a GPD with uniform margins by just trans- 
forming the margins. This transformation can also be done backwards; see Section 5.6 
in Falk et al. [17]. We will, therefore, consider in what follows multivariate GPD derived 
from an EVD G with negative exponential margins. For a recent account on multivariate 
GPD, see Michel [25]. 

From the de Haan-Resnick-Pickands representation of a multivariate EVD, it is well 
known that a function G on (— oo,0] m is the distribution function of an EVD with 
negative standard exponential margins Gi{x) = exp(a:), x < 0, i < m, if and only if it can 
be represented as 

G(x)=exp( / min(xjtj)/x(dt) I , x < 0, 
\Js m J 
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where fi is a finite measure on S m ■= {t > 0: ^2 i<m ti = 1}, called an angul 



ar mea- 



sure, with the characteristic property J s ti/i(dt) = 1, i <m; see Section 4.2 in Falk 
et al. [17]. Note that this integrabihty condition on fj, implies that fJ,(S m ) = J s ld// = 

Js m Ei< m ^M(dt) = Ei<™ J Sm ^M(dt) = m. 

As a consequence we obtain that a multivariate GPD W with standard uniform margins 
1 — Wi(x) = x, i< m, in a left neighborhood of £ R m can be represented as 

W(x) = 1+1 ) / max(a; l t i )^(dt) 

=: 1+ f ^ Xj jD(xi,...,x m -i) 

for x <x< 0, where fi is as above, Xi=Xi/J2j< m %j and D: {u£ [0, l] m_1 : Y^,j< m -l w j — 
1} — > [1/m, 1] is a Pickands dependence function (Section 4.3 in Falk et al. [17]). 

The following result characterizes a GPD with uniform margins in terms of random 
variables. It provides an easy way to generate a multivariate GPD, thus extending the 
bivariate approach proposed by Buishand et al. [6] to an arbitrary dimension. Recall that 
an arbitrary multivariate GPD can be obtained from a GPD with ultimately uniform 
margins by just transforming the margins. For a recent account on simulation techniques 
of multivariate GPD, see Michel [26]. 

Proposition 2.4. 

(i) Let W be a multivariate GPD with standard uniform margins in a left neighbor- 
hood of £ R m . Then there is a random vector Z = (Z\, . . . , Z m ) with Zi £ [0, m] and 
E(Zi) = 1, i<m, and a vector ( — 1/m, . . . , —1/m) < xo < such that 

^x)=p(-^,..,£)<x), x <x<0, 

where the random variable U is uniformly distributed on (0, 1) and independent of Z. 

(ii) The random vector —U(l/Zi,...,l/Z m ) follows a GPD with standard uniform 
margins in a left neighborhood of £ K m if U is independent of Z = (Z\, . . . , Z m ) and 
< Zi < Ci a.s. with E(Zi) = 1, i <m, for some Ci,...,c m >l. 

Note that the case of a GPD W with arbitrary uniform margins Wi(x) = 1 — aiX in 
a left neighborhood of with arbitrary scaling factors > 0, i < m, immediately follows 
from the preceding result by substituting Zi by aiZi. 

Proof of Proposition 2.4. First we establish part (i). From representation (7) we 
obtain that for x in a left neighborhood of £ R m 



«r W - 1+ (j>)/ 



max(x^j)/i(dt) 



with some measure /i on S m such that /J,(S m ) = m and J s £j/x(dt) = 1, i < m. 
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Now /}(•) = jti(-)/m defines a probability measure on S m . Let T= (Ti,...,T m ) be 
a random vector with values in S m that has distribution /} and put Z := mT. Then 
Z e [0, m] m and £(Zj) = f Sm i,/i(dt) = 1, i < m. We have, further, for x < e R m with 
> — 1/m, j < m, 

= p( -III ) <mx 

= jT p(-C/(i <mx|T = t)(P*T)(dt) 

p(-tf(i,...,£)<mx)A(dt) 



— / pf [/ > mmax(— Xit^^/zfdt) 

— / 1 + to N a;,- ) max(xjfj)it(dt) 



\j<1T» 



= 1+1 J / max(iiij)/i(dt). 



\7<m 

This implies part (i) of the proposition. 

On the other hand, we have for x < and large s > 

V u \z 1 , '" , z m )~s x 



[ p(u>-m&x(-x l z i ))(P*Z)(dz) 
J[o,c] V s J 

1--/ max(-a; 4 z l )(P*Z)(dz) N ) 

s J[0,c] / 

expl — / max(— Xj^i)(P * Z)(dz) 

3 V J\0,c\ 



=: G(x) 

with c = (ci,...,c m ). 
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Lemma 7.2.1 in Reiss [32] now implies that G is a distribution function that is obviously 
max stable: G s (s~ 1 x) = G(x), s > 0; that is, G is a multivariate EVD and has negative 
standard exponential margins Gi(x) = exp(xE(Zi)) = exp(x), x < 0. As a consequence, 
1 + log(G(x)) is a GP function with 

1 + log(G(x)) = 1 - / max(-XiZi){P * Z)(dz) 

J[0,c] l < m 




for Xq < x < and some Xq < 0. □ 



Let, for instance, C be an arbitrary m-dimcnsional copula; that is, C is the distribu- 
tion function of a random vector S with uniform margins P(Si < s) = s, s£(0,l),i<m, 
(Nelscn [29]). Then Z := 2S is a proper choice in part (ii) of Proposition 2.4. Proposi- 
tion 2.4, therefore, maps the set of copulas in a natural way to the set of multivariate 
GPDs, thus opening a wide range of possible scenarios. 

According to Theorem 2.1, we call a copula Cw a GPD copula on [0,1]™ or simply 
a GPD copula if there exists yo < 1 such that 

CW(y) = W(y - 1), y <y<l, 

where W is a GPD with standard uniform margins in a left neighborhood of zero. 

For mathematical convenience we temporarily shift a copula to the interval [—1,0]™ by 
shifting each univariate margin by —1. Thus we obtain a distribution function Cw from 
a GPD copula Cw-, whose marginal distribution functions are the uniform distribution 
on [—1,0], and Cw coincides close to zero with a GPD W as in equation (7); that is, 
there exists Xo < such that 

CV(x) = W(xi, . . .,x m ) 

= l+(Vx 7 )/ maxU 4 — U(dt), xe[x ,0]. 

Because Cw inherits its properties from the original GPD copula Cw , we call Cw a GPD 
copula on [— 1, 0] m . 

For later purposes we remark that a random vector V G [— l,0] m following a GPD 
copula on [— l,0] m can easily be generated as follows, using Proposition 2.4. Let U be 
uniformly distributed on (0,1) and independent of the vector S = (Si, . . . , S m ), which 
follows an arbitrary copula on [0, l] m . Then we have for i <m 

Un' 

= :H{x), 



if- 2 <z<0, 
1 

:fx<--, 
x<0, 
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and, consequently, 



with 




follows by Proposition 2.4 a GPD copula on [—1,0] 



3. Multivariate piecing together 

The multivariate PT approach consists of two steps. In a first step, the upper tail of 
a given m-dimensional copula C is cut off and substituted by the upper tail of multi- 
variate GPD copula in a continuous manner. The result is again a copula, that is, an 
m-dimcnsional distribution with uniform margins. The other step consists of the trans- 
formation of each margin of this copula by a given univariate distribution function F* , 
1 < i < m. This provides, altogether, a multivariate distribution function with prescribed 
margins F* whose copula coincides in its central part with C and in its upper tail with 
a GPD copula. 

We start with fitting a GPD copula to the upper tail of a given copula C on [—1,0]™. 
Recall that for mathematical convenience we shift any copula C(u), u G [0, l] m , to a cop- 
ula on [-l,0] m by setting C(v) = (7(1 + v), vG [-1,0]™. 

Let V = (Vi, . . .,V m ) follow a GPD copula on [-1,0]™; that is, P(V t <x) = l + x, 
— 1 < x < 0, is for each i <m the uniform distribution on [—1,0], and there exists Xo = 
(xq , . . . , Xq ) < such that for each x = (xx, . . . , x m ) G [xo, 0] 

P(V<x) = l+(^x^(^-,...,^-), 

where D is a Pickands dependence function. 

Let Y = (Yi, . . . , Y m ) follow an arbitrary copula C on [—1, 0]™ and suppose that Y is 
independent of V. Choose a threshold y = (yi, . . . ,y m ) G [— 1,0]™ and put 

Qi := Yil(Y t <y t ) - yiVil(Yi> yi ), i < m- (9) 

The random vector Q then follows a GPD copula on [—1,0]™, which coincides with C 
on Xj< m [— 1,2/J. This is the content of the main result of this section. 

Proposition 3.1. Suppose that P(Y > y) > 0. Each Qi defined in (9) follows the uni- 
form distribution on [—1,0]. The random vector Q = (Qi, . . . , Q m ) follows a GPD copula 
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on [— l,0] m , which coincides with C on Xi< m [— 1, yi\; that is, 

P(Q<x) = C(x), x<y. 

We have, moreover, with Xi G [max(j/i,:Eo ),0], * < TO ; / or an y non-empty subset K of 
{l,...,m} 

P(Q, > ar^i G K) = P(Vi > b l:K x u ieK), 

where 

p(Y J>y] ,jeK) p(Y j>yj ,jeK) 
hK - = P(r l>2/l ) e(0 ' 1] ' ieK - 

Proof. First wc show that each Qi follows the uniform distribution on [—1,0]. We have 
for — 1 < x < yi 

P(Qi < x) = P(Q, < x, Y, < yi ) + P{Qi < x, Y % > w ) 
= P(Yi < x) 
= 1+x, 

whereas for yi < x <0 we obtain 

P(Qi < x) = P(Y t < Vi ) + Pi-ViVi < x)P{Y t > Vi ) 

= i + y i + p(v i <-^{-y i ) 

= l +tt+ (l -£)(-«) 
= l+x. 

The random vector Q, thus, follows a copula on [—1,0]™. We have, further, for x < y 

P(Q < x) = P(Q < x, Y < y) + P(Q < x, Y ^ y) 
= P(Y < x) 
= C(x). 

By Proposition 2.1 in Falk and Michel [18] we have with Xi G [max(i/i,Wi),0], i <m, 
t G [0, 1] and an arbitrary subset K C {1, . . . , m} 

P(Qj > txj,j G K) = P{Qj > tx^Yj > y h j G K) 

= P{ yXj > txjj G K)P(Yj > Vj ,j G K) 
= tP(- yj Vj > xj , j G K |P| )', > yj ,j G K) 
= tP(Q j >x j ,jeK), 

which, again by Proposition 2.1 in Falk and Michel [18], implies that Q follows a GPD. 



12 



S. Aulbach, V. Bayer and M. Falk 



We have, moreover, with xi <E [max(j/j, Xq^), 0], i<m, 
P(Qi >Xi,i£K) 

= P{Qi >x ll Y i >y l ,ie K) + P(Qi >Xi,i£ K, Y 3 < y 3 - for some j g K) 
= P{~y l V i > Xi,i£ K)P(Yi > y h ie K) 

= p(v i >-^,i€KjP(Y i >y i ,i€K) 

= P(Vi>b it KXi,ieK). □ 



The above approach provides an easy way to generate a random vector X £ M m with 
prescribed margins F* , i <m, such that X has a given copula in the central part of the 
data, whereas in the upper tail it has a GPD copula. 

Take Q = (Qi, . . . , Q m ) as in (9) and put Q := (Qi + 1, . . . , Q m + 1). Then each com- 
ponent Qi of Q is uniformly distributed on (0,1) and thus 

X := (X x , ...,X m ):= (FrHQi),- ■ ■ , KaHQm)) (10) 

has the desired properties. 

Combining the univariate and the multivariate PT approach now consists in choosing 
a threshold u(i) S M for each dimension i <m and a univariate distribution function Fi 
together with an arbitrary univariate GPD W 7iifli!Ti , and putting for i <m 

F*( x ) •= / ^i(x), ifa;<u(i) 
t(l-fi(u(ip W( (i) + F<(«(<)). ifx>u( l )- 

This is typically done in a way such that F* is a continuous function. 



4. An application to operational loss data 

In this section we apply our multivariate PT approach to operational loss data. For 
an excellent introduction to operational risk and insurance analytics, see Chapter 10 of 
McNeil et al. [23] and the literature cited there. In the sequel we give a brief summary. 

According to the New Basel Capital Accord (Basel II), banks are required to de- 
termine the regulatory capital charge for operational risk, defined as the risk of losses 
resulting from inadequate or failed internal processes, people and systems or from exter- 
nal events. The Basel Committee on Banking Supervision encourages the use and further 
development of advanced modelling techniques to quantify operational risk. The most 
risk-sensitive methodology is the loss distribution approach using bank internal data to 
estimate probability distribution functions for each business line/event type category. 
To provide a greater consistency of loss data collection within and between banks, op- 
erational losses are classified in eight business lines and seven event types. To calculate 
the capital charge for each business line/event type combination, a risk measure such 
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as value at risk to the 99.9% confidence level over a one-year holding period is chosen. 
A conservative way to assess a bank's total capital requirement is to sum up the capital 
charges across business line/event type classes assuming perfect dependence and disre- 
garding diversification effects in operational risk. Actually, the fact that all severe losses 
occur in the same year is rather dubious. Therefore, the dependence structure among 
losses of different business line/event type categories needs to be modelled explicitly. For 
simplicity, we consider in what follows only business lines and not event types. 

The frequency of a loss event for business line i over a one-year time horizon will be 
denoted by N(i). The random loss associated with the kth loss event for business line i 
will be denoted by Cfe(*)- 

The random loss L(i) over one year for business line i is, therefore, modelled as 

N(i) 

k=l 

where CiW^W; ■ • ■ are assumed to be i.i.d. with distribution function Fi and they are 
independent of their total number N(i). 

The goal is to model the total loss distribution for operational risk; that is, the distri- 
bution of 

m 

£:=X>(0> 

i=l 

or parameters of it such as the value at risk VAR(a) at the probability level a satisfying 

P(L > VAR(a)) = 1 - a 
or the expected shortfall at the probability level a 

ES(a) := E(L \ L > VAR(a)). 

In order to assess the total capital charge, the traditional models for measuring opera- 
tional risk determine VAR(a) and ES(a) for each of the m business lines separately and 
then simply sum up the corresponding capital charges. 

In contrast, Di Clemente and Romano [14] suggest modelling the dependence structure 
among L(l), . . . , L(m) by a copula function, precisely, by the copula corresponding to the 
m-dimensional f-distribution with v degrees of freedom. For a closer look at the issue of 
modelling the dependence among components of a random vector of financial risk factors 
using the concept of a copula, see Chapter 5 of McNeil et al. [23] . 

In our application we analyse operational losses of the external database SAS OpRisk 
Global Data, which contains worldwide information on publicly reported operational 
losses over US $100000. Since we do not know the probability of losses lying under 
US $100000, we neglect this cut-off limit in modelling the severity and frequency of the 
data. We concentrate on two business lines of the financial sector, Commercial banking 
and Retail banking. 
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First we follow the copula extreme value theory approach for modelling operational 
loss data as outlined in Di Clcmcnte and Romano [14], but we add the multivariate PT 
approach developed in Section 3. 

4.1. Estimation 

Before applying the multivariate PT approach, we explore the characteristics of the 
empirical distributions of the two business lines and estimate the model's parameters. 
Thereby the severity distribution of the random variable Cfc(i) and the frequency dis- 
tribution of the random variable N(i), i = 1,2, arc treated separately. To obtain the 
distribution function of the total loss L(i) of the business line i over a one-year time 
horizon we accomplish a Monte Carlo simulation combining the severity distribution 
with the frequency distribution. 

First we analyse the empirical distributions of the loss severity. The measures skewness 
and kurtosis indicate that the empirical distributions of the two business lines are skewed 
to the right and very heavy tailed. 

In the next step, parametric distributions (i.e., Weibull, gamma and lognormal distri- 
bution) are fitted to the data. The parameters are estimated by the maximum likelihood 
method. With the help of graphical analysis (QQ plots, theoretical versus empirical dis- 
tribution function plots) and goodness-of-fit tests (Anderson-Darling test, Cramer-von 
Mises (CvM) test), we conclude, that none of the selected distributions provides a good 
fit to the complete data sets. (For a detailed presentation and discussion of goodness-of-fit 
techniques, see D'Agostino and Stephens [8].) However, the lognormal distribution fits the 
body of the data very well, while it underestimates the severity of the data in the right tail. 

Therefore, to fit the tail data accurately, the univariate POT method in the model of 
the severity distribution of the losses Cfc(*) is applied: The existence of a threshold u(i) for 
each business line i is assumed such that £fc(*) follows a lognormal distribution function 
below u(i), whereas above u(i) it follows a univariate GPD, that is, 

P(Ct(i)<aO 

Fi(x), x<u(i), 
Fi(u(i)) + (1 - Fi(u(i))) GPB mm (x - «(*)), x > u(i), 

where the GPD is given by 

GPD pm u(z) := 1 - (l + J , z > 0, 

with shape and scale parameter £(£) > 0, (3(i) > 0. Furthermore, F{ is defined as Fi(x) := 
$((log(;r) — fj,(i))/a(i)), where $ is the standard normal distribution function and € 
R, a(i) > are location and scale parameters of the lognormal distribution. 
In this case we obtain for x > u(i) 

p(a« > x) = p(c*(o > u(0) (i + m^f> 
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Table 1. Estimated model parameters 





a(i) 


r(i) 


fi(i) 




u{i) 


0(0 




Commercial banking 


0.74 


46.10 


2.19 


2.23 


918.02 


609.84 


0.82 


Retail banking 


0.39 


162.04 


0.88 


2.06 


69.18 


99.75 


1.02 



Table 2. Estimated 


correlation matrix for 


the t 


copula 




Commercial 




Retail 


Commercial 


1 




0.76 


Retail 


0.76 




1 



The threshold u(i) is chosen with the help of mean excess plots. The shape and scale 
parameters of the GPD are estimated by the maximum likelihood method. For a dis- 
cussion of the parameter estimation of a GPD and optimal choice of the threshold, see 
Section 6 of Embrechts et al. [15]. 

To determine the frequency distribution of the random variable N(i), the Poisson and 
negative binomial distribution are fitted to the total number of losses per year. The 
parameters of these distributions are estimated by the method of moments. Since the 
negative binomial distribution has two parameters, a and r, it is more flexible and often 
provides a better fit to operational loss data than the Poisson distribution; see Cruz [7] , 
page 89. With the help of the \ 2 goodness-of-fit test, this expectation is confirmed. 
Therefore, the random variable N(i) is modelled by the negative binomial distribution, 
whose probability mass function is expressed as 

with a(i) > 0, r(i) > 0. The resulting estimates of the model's parameters are given in 
Table 1. 

In the following, we model the dependence structure among L(l) and L(2) by a cop- 
ula function. The assumption of a normal copula has been quite popular in finance for 
modelling the dependence between different risks, but it puts less weight on observations 
that are large in each component; see, for example, Rachev et al. [31]. The t copula is 
more heavily tailed and, therefore, better suited for modelling operational risk. 

In our bivariate case the t copula is fitted to the total loss data over a one- year time 
horizon. The parameter of the correlation matrix and the degrees of freedom v are es- 
timated by the maximum likelihood method. For a discussion of the problem of fitting 
copulas to data, see Section 5.5 of McNeil et al. [23]. 

Table 2 contains the estimated correlation matrix for the t copula with v = 8.64 esti- 
mated degrees of freedom. 
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Table 3. Goodness-of-fit test for the t copula 



CvM statistic 


p- value 


0.02543851 


0.521978 



To evaluate the goodness of fit of the t copula, the CvM test is applied. For recent 
reviews of copula goodness-of-fit testing, see Berg [4] and Genest et al. [21]. In Table 3 
the CvM test value and corresponding p-value are reported. 

Since the p- value is 0.521978, the null hypothesis that the dependence structure of the 
data follows a t copula is not rejected. 

4.2. Simulation 

In the previous section we estimated the parameters that specify a situation where only 
the univariate PT approach is applied. This model is similar to the one described in Di 
Clemente and Romano [14] . Now we show by simulations how popular risk measures such 
as value at risk or expected shortfall can be influenced by the combination of univariate 
and multivariate piecing together. 

First we take the t copula derived in Section 4. land add the multivariate PT approach 
developed in Section 3. We simulate 10 4 independent copies of Y = (Yi, Y2) and V = 
(Vl, V2), which follow the t copula from above and a GPD copula on [— 1, 0] 2 , respectively. 
The realizations of Y := Y 1 and V are then combined with those of a random vector Q 
according to definition (9). The distribution of Q is then a GPD copula on [— 1, 0] 2 which 
coincides with the previously mentioned t copula - shifted by minus one - below some 
threshold vector y = (2/1,2/2)- The last step consists in shifting the realizations of Q to 
the interval [0,1] 2 and transforming the margins by F-j*,^* according to equation (10). 
(F£,F2 are derived from Monte Carlo simulations as described in Section 4.1.) Thus we 
obtain 10 4 realizations of a random vector X that follows a multivariate distribution 
function that has the marginal distribution functions F* , F£ and the associated copula 
is a GPD copula that coincides with the original t copula in its central part. These 
realizations of X are then taken to compute the empirical counterparts of the value at 
risk and the expected shortfall. 

Before we apply these steps, we remark that there are still two remaining degrees 
of freedom in our model: the GPD copula that underlies V and the copula threshold 
vector y. Our goal in this section is to give a first insight into the consequences of replacing 
the upper tail of a given copula with a GPD copula. (Note that this procedure is justified 
by Theorem 2.1 and Corollary 2.2.) For this purpose, we assume a simple model: 

(i) We define the GPD copula underlying V indirectly by setting Z := 2S in Propo- 
sition 2.4, where S follows a bivariate normal copula. 

(ii) The copula threshold vector is obtained by y := (Fi(u(l)), i*2(it(2))) — 1; that is, 
the thresholds for the marginal distributions in the univariate PT approach (see equa- 
tion (12)) are transformed and used for the multivariate PT approach, too. 
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This way, we construct a parametric model that is only dependent on the correlation 
matrix 




ee[o,i), 



of the random vector S. It is typically very difficult, particularly in higher dimensions, to 
find a good multivariate model that describes both marginal behavior and dependence 
structure effectively. The advantage of the preceding copula model is that it depends on 
just one parameter g G [0,1), allowing dependence and independence of the margins in 
a simple and continuous manner. We refer again to McNeil et al. [23]. 

The simulations as described above were now done for various values of g. To attain 
more reliable estimates for the value at risk and the expected shortfall, we simulated 
not only once but 50 times and took the average. Additionally, these values were also 
computed for the case in which only the univariate PT approach is applied and the t 
copula kept unchanged. This procedure allows us to identify the effect the multivariate 
PT approach has on the mentioned risk measures. 

In the following we state our results from the simulation series with g = 0.7, which 
models the case of relative high dependence but not complete dependence between the 
business lines. Although a graphical analysis of the used GPD copulas suggested that the 
degree of dependence in the upper tail was increasing with g getting larger, there was no 
observable trend in the estimates of the value at risk and the expected shortfall. Further 
research is necessary to derive criteria for the optimal choice of g or, more generally, of 
the GPD copula underlying V and the copula threshold y. 

We now start with the presentation of the results. For simplicity, we identify the 
business lines Commercial banking and Retail banking with the cases i = 1 and i = 2, 
respectively. The random vector X from above models the combined random losses for 
these two business lines over one year, that is, X = (L(l), L(2)). The value at risk and the 
expected shortfall of L(l), L(2) and L = L(l) + L(2) were computed using their empirical 
counterparts 

VAR(a) = F~ 1 (a), 

where F is the empirical distribution function of 1/(1), L(2) or L, respectively, and 

-~- 1 ™ 

ES(a) = nd-a) ^'' 1 ^)^^' 

where k is the ith realization of L(l), L(2) or L, respectively. By 1b we denote the 
indicator function of a set B, that is, 1b(x) = 1 if x G B and 1b(x) = otherwise. 

Table 4 gives the means of 50 independent simulations resulting from the multivariate 
PT approach, whereas Table 5 makes use of the univariate approach only. Since the 
marginal distributions are the same in both cases, namely F^F^, the value at risk 
estimates concerning L(1),L(2) nearly coincide across both tables. It is apparent that 
the respective values for L, the total loss, are only slightly different. 

On the other hand, the empirical expected shortfalls attain clearly higher values if the 
upper tail of the t copula is substituted by a GPD. This behavior is independent of the a 
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Table 4. Estimated value at risk and expected shortfall, GPD copula 



YAR(q) ES(q) 
95% 99% 99.5% 99.9% 95% 99% 99.5% 99.9% 



Lil) 13638 32899 49650 159442 46436 154758 270036 1038077 
L(2) 12 586 45370 84386 390127 93 365 381142 702350 2880672 
L 26578 75518 127042 533701 135581 512781 930472 3746889 



Table 5. Estimated value at risk and expected shortfall, t copula 



VAR(q) ES(a) 



a 


95% 


99% 


99.5% 


99.9% 


95% 


99% 


99.5% 


99.9% 


L(l) 


13638 


32 667 


49196 


153 322 


37674 


111075 


182956 


608 755 


L(2) 


12 590 


45 601 


83414 


392 673 


71288 


270 821 


482058 


1774252 


L 


25428 


75 674 


131267 


533710 


105122 


366904 


637184 


2261 144 



level and holds for the losses in the single business lines as well as for the total loss in our 
simulation. This is remarkable since the corresponding estimates for the value at risk are 
in both cases nearly the same, indicating that there are some extreme high losses that 
occur very rarely. 

Clearly, with underlying estimated GPD shape parameters £(1) = 0.82 and £(2) = 1.02, 
the theoretical expected shortfall exists only for i = 1 but not for i = 2. The significant 
increases for ES(a) in line £(1) and L(2) in Table 4 should, therefore, only be due to 
the high volatility of the empirical expected shortfall, whereas the significant increase in 
line L should be caused by the substituted GPD copula as well. This example of real 
operational loss data might be considered as a warning, not to underestimate the effects 
of rare events that nevertheless might occur simultaneously. 

In cases of no existing theoretical expected shortfall, Moscadelli [28] suggests the risk 
measure median shortfall (MS) that is defined regardless of the values of the shape 
parameter £. If the respective distribution function is continuous, the median shortfall 
has the representation 

MS (a) = VAR 

sec Biagini and Ulmer [5], pages 749-750. In addition to our previous results, Table 6 
states the estimates for the median shortfall, which we computed as MS (a) = VAR((1 + 
a)/2). 

Unlike the expected shortfall, the median shortfall as a robust measure is not as heavily 
influenced by extreme values in the upper tail. Therefore, the median shortfall estimates 
for L(l), L(2) are closer to each other than the expected shortfall estimates if we compare 
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Table 6. Estimated median shortfall 







MS(a) 


t copula 






MS(q), 


GPD copula 




a 


95% 


99% 


99.5% 


99.9% 


95% 


99% 


99.5% 


99.9% 


L(l) 


19829 


49196 


78678 


243 938 


19829 


49650 


80 292 


313 246 


L(2) 


21494 


83414 


162438 


793 252 


21600 


84386 


162866 


782 938 


L 


40340 


131267 


234910 


962458 


42463 


127042 


229 260 


1085 283 



1 1 

500000 



1500000 2500000 3500000 



500000 



1 500000 2500000 3500000 



Figure 1. 10 4 random deviates of (L(l), L(2)) based on the original t copula (left) and on the 
GPD copula (right). 



the t copula case with the GPD copula case. However, the values for the median shortfall 
of L at the 99.9% level do clearly differ, indicating a small number of extreme high losses 
in the upper tail - in accordance with our considerations on the expected shortfall, see 
page 18. 

To give visual insight into these results, Figure 1 compares the realizations of X = 
(L(1),L(2)) graphically. The graphics were taken from a single simulation that con- 
tributed to the results in Table 4 and Table 5. Although the t copula itself is already 
heavily tailed, the substitution of its upper part by a GPD puts even more weight on 
observations that are very high in both components. (Recall Corollary 2.2.) The latter 
type of modelling, therefore, represents a higher risk of an extraordinarily high total loss 
over a one-year time horizon. This can be seen on the point in the upper right corner in 
the right scatterplot of Figure 1, which represents a Active total loss of 6275 000, whereas 
the highest total loss in the respective pure t copula scenario is 3313 000. 

5. Conclusions 

In the present paper we extended the well known univariate PT approach to higher 
dimensions. This was motivated by Theorem 2.1 and Corollary 2.2, which show that it is 
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not sufficient to apply the univariate approach to the marginal distributions of a random 
vector if the upper tail of its distribution is to be modelled adequately. It is, therefore, 
necessary to approximate the underlying copula by a multivariate GPD. 

The multivariate PT approach that was introduced in (9) offers a wide range of sce- 
narios to be modelled because it depends basically only on some random vector whose 
components need to be bounded and to have expectation one; see Proposition 2.4. As 
a consequence we also mentioned a natural way to map the set of copulas to the set of 
multivariate GPDs that was useful for our simulation studies to obtain a one-parametric 
model. 

Because the values in a simulation are random, simulations occur that produce no 
values that are high in both components. (This is depending on the sample size, too.) 
Fixing this disadvantage is subject to further research and could probably be achieved 
by making additional restrictions on the GPD copula random vector in (9). Furthermore, 
goodness-of-fit testing of the compound GPD copula is required. 

Nevertheless, the multivariate PT approach is a powerful and suitable tool to ad- 
equately model multivariate distributions in their upper tails. This ensures that the 
probability of very rare events that occur simultaneously and have a high effect if they 
occur is not underestimated. The high empirical expected shortfalls in Table 4 might be 
considered as a warning. 
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